[flang] Optimize `acospi` precision #152869

c8ef · 2025-08-09T17:04:31Z

This patch implements the solution for acospi proposed in #150452, improving precision for r16 and removing a redundant cast for r4.

c8ef · 2025-08-09T17:04:46Z

[flang] Optimize acospi precision #152869 👈 (View in Graphite)
main

This stack of pull requests is managed by Graphite. Learn more about stacking.

llvmbot · 2025-08-09T17:05:27Z

@llvm/pr-subscribers-flang-fir-hlfir

@llvm/pr-subscribers-llvm-support

Author: Connector Switch (c8ef)

Changes

Full diff: https://github.com/llvm/llvm-project/pull/152869.diff

3 Files Affected:

(modified) flang/lib/Optimizer/Builder/IntrinsicCall.cpp (+4-4)
(modified) flang/test/Lower/Intrinsics/acospi.f90 (+1-2)
(modified) llvm/include/llvm/Support/MathExtras.h (+2)

diff --git a/flang/lib/Optimizer/Builder/IntrinsicCall.cpp b/flang/lib/Optimizer/Builder/IntrinsicCall.cpp
index bfa470dd5690d..3e6fbafe8a6b3 100644
--- a/flang/lib/Optimizer/Builder/IntrinsicCall.cpp
+++ b/flang/lib/Optimizer/Builder/IntrinsicCall.cpp
@@ -2687,10 +2687,10 @@ mlir::Value IntrinsicLibrary::genAcospi(mlir::Type resultType,
   mlir::FunctionType ftype =
       mlir::FunctionType::get(context, {resultType}, {args[0].getType()});
   mlir::Value acos = getRuntimeCallGenerator("acos", ftype)(builder, loc, args);
-  llvm::APFloat inv_pi = llvm::APFloat(llvm::numbers::inv_pi);
-  mlir::Value dfactor =
-      builder.createRealConstant(loc, mlir::Float64Type::get(context), inv_pi);
-  mlir::Value factor = builder.createConvert(loc, resultType, dfactor);
+  llvm::APFloat inv_pi =
+      llvm::APFloat(llvm::cast<mlir::FloatType>(resultType).getFloatSemantics(),
+                    llvm::numbers::inv_pis);
+  mlir::Value factor = builder.createRealConstant(loc, resultType, inv_pi);
   return mlir::arith::MulFOp::create(builder, loc, acos, factor);
 }
 
diff --git a/flang/test/Lower/Intrinsics/acospi.f90 b/flang/test/Lower/Intrinsics/acospi.f90
index dcacd25bca480..facb57abcdbcc 100644
--- a/flang/test/Lower/Intrinsics/acospi.f90
+++ b/flang/test/Lower/Intrinsics/acospi.f90
@@ -10,8 +10,7 @@ function test_real4(x)
 ! CHECK-LABEL: @_QPtest_real4
 ! CHECK-PRECISE: %[[acos:.*]] = fir.call @acosf({{%[A-Za-z0-9._]+}}) fastmath<contract> : (f32) -> f32
 ! CHECK-FAST: %[[acos:.*]] = math.acos %{{.*}} : f32
-! CHECK: %[[dpi:.*]] = arith.constant 0.31830988618379069 : f64
-! CHECK: %[[inv_pi:.*]] = fir.convert %[[dpi]] : (f64) -> f32
+! CHECK: %[[inv_pi:.*]] = arith.constant 0.318309873 : f32
 ! CHECK: %{{.*}} = arith.mulf %[[acos]], %[[inv_pi]] fastmath<contract> : f32
 
 function test_real8(x)
diff --git a/llvm/include/llvm/Support/MathExtras.h b/llvm/include/llvm/Support/MathExtras.h
index 7bbf1b3aab761..851fe4829b675 100644
--- a/llvm/include/llvm/Support/MathExtras.h
+++ b/llvm/include/llvm/Support/MathExtras.h
@@ -74,6 +74,8 @@ constexpr float ef          = 0x1.5bf0a8P+1F, // (2.71828183) https://oeis.org/A
                 sqrt3f      = 0x1.bb67aeP+0F, // (1.73205081) https://oeis.org/A002194
                 inv_sqrt3f  = 0x1.279a74P-1F, // (.577350269)
                 phif        = 0x1.9e377aP+0F; // (1.61803399) https://oeis.org/A001622
+constexpr const char *pis     = "3.141592653589793238462643383279502884",
+                     *inv_pis = "0.318309886183790671537767526745028724";
 // clang-format on
 } // namespace numbers

tblah

Thanks for coming back to this. Please could you add a test for f128 like in your issue.

llvm/include/llvm/Support/MathExtras.h

c8ef · 2025-08-11T10:23:33Z

Please could you add a test for f128 like in your issue.

I can do this, but I'm concerned it might break the AIX build bot. It seems they use a different fp format. We may need to add separate fp numbers based on the target triple to address this? (Or simply omit the f128 test...

tblah · 2025-08-11T14:28:39Z

Please could you add a test for f128 like in your issue.

I can do this, but I'm concerned it might break the AIX build bot. It seems they use a different fp format. We may need to add separate fp numbers based on the target triple to address this? (Or simply omit the f128 test...

Try ! REQUIRES: flang-supports-f128-math in the lit test. For example see test/Lower/Intrinsics/acos_complex16.f90.

c8ef · 2025-08-11T14:49:01Z

Please could you add a test for f128 like in your issue.

I can do this, but I'm concerned it might break the AIX build bot. It seems they use a different fp format. We may need to add separate fp numbers based on the target triple to address this? (Or simply omit the f128 test...

Try ! REQUIRES: flang-supports-f128-math in the lit test. For example see test/Lower/Intrinsics/acos_complex16.f90.

I'm actually referring to the following case:

#include "llvm/ADT/APFloat.h"
#include "llvm/Support/raw_ostream.h"

int main() {
  llvm::APFloat pi(llvm::APFloat::IEEEquad(),
                   "0.318309886183790671537767526745028724");
  pi.print(llvm::errs());
  llvm::APFloat pi_ppc(llvm::APFloat::PPCDoubleDouble(),
                       "0.318309886183790671537767526745028724");
  pi_ppc.print(llvm::errs());
}
// 0.318309886183790671537767526745028737
// 0.31830988618379067153776752674503

They have different rounding, so an exact match is difficult. I chose to match the common prefix instead. It should work on all build bots now.

tblah

LGTM. Thanks for being so careful not to break the build bots.

c8ef · 2025-08-11T16:15:05Z

Merge activity

Aug 11, 4:15 PM UTC: Graphite rebased this pull request as part of a merge.
Aug 11, 4:18 PM UTC: Graphite rebased this pull request as part of a merge.
Aug 11, 4:21 PM UTC: Graphite rebased this pull request as part of a merge.
Aug 11, 4:26 PM UTC: Graphite rebased this pull request as part of a merge.

c8ef marked this pull request as ready for review August 9, 2025 17:04

llvmbot added flang Flang issues not falling into any other category llvm:support flang:fir-hlfir labels Aug 9, 2025

c8ef requested review from tblah and vzakhari August 9, 2025 17:08

tblah reviewed Aug 11, 2025

View reviewed changes

llvm/include/llvm/Support/MathExtras.h Show resolved Hide resolved

c8ef force-pushed the users/c8ef/_flang_optimize_acospi_precision branch from 34920d8 to bcdd26a Compare August 11, 2025 14:45

c8ef requested a review from tblah August 11, 2025 14:50

tblah approved these changes Aug 11, 2025

View reviewed changes

c8ef force-pushed the users/c8ef/_flang_optimize_acospi_precision branch from bcdd26a to 5920e94 Compare August 11, 2025 16:14

c8ef force-pushed the users/c8ef/_flang_optimize_acospi_precision branch 2 times, most recently from 1080d68 to ed4ff68 Compare August 11, 2025 16:21

[flang] Optimize acospi precision

36f6e4b

c8ef force-pushed the users/c8ef/_flang_optimize_acospi_precision branch from ed4ff68 to 36f6e4b Compare August 11, 2025 16:26

c8ef merged commit 7b41c2f into main Aug 11, 2025
7 of 9 checks passed

c8ef deleted the users/c8ef/_flang_optimize_acospi_precision branch August 11, 2025 16:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[flang] Optimize `acospi` precision #152869

[flang] Optimize `acospi` precision #152869

Uh oh!

c8ef commented Aug 9, 2025 •

edited

Loading

Uh oh!

c8ef commented Aug 9, 2025

Uh oh!

llvmbot commented Aug 9, 2025 •

edited

Loading

Uh oh!

tblah left a comment

Uh oh!

Uh oh!

c8ef commented Aug 11, 2025

Uh oh!

tblah commented Aug 11, 2025

Uh oh!

c8ef commented Aug 11, 2025

Uh oh!

tblah left a comment

Uh oh!

c8ef commented Aug 11, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

[flang] Optimize acospi precision #152869

[flang] Optimize acospi precision #152869

Uh oh!

Conversation

c8ef commented Aug 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

c8ef commented Aug 9, 2025

Uh oh!

llvmbot commented Aug 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tblah left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

c8ef commented Aug 11, 2025

Uh oh!

tblah commented Aug 11, 2025

Uh oh!

c8ef commented Aug 11, 2025

Uh oh!

tblah left a comment

Choose a reason for hiding this comment

Uh oh!

c8ef commented Aug 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Merge activity

Uh oh!

Uh oh!

Uh oh!

[flang] Optimize `acospi` precision #152869

[flang] Optimize `acospi` precision #152869

c8ef commented Aug 9, 2025 •

edited

Loading

llvmbot commented Aug 9, 2025 •

edited

Loading

c8ef commented Aug 11, 2025 •

edited

Loading